This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 

IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



THIS PAGE BLANK (uspto) 



per 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PC7) 



(51) International Patent Classification 5 : 
C07H 15/12, C07K 13/00, 15/00 
A61K 39/00, C12N 15/00 
C12P 21/00 



Al 



(11) International Publication Number: 
(43) International Publication Date: 



WO 92/11272 

9 July 1992 (09.07.92) 



(21) International Application Number : PCT/US9 1 /09630 

(22) International Filing Date: 20 December 1991 (20.12.91) 



(30) Priority data: 

631,659 



20 December 1990 (20.12.90) o J 



(71) Applicant: IXSYS, INC. [US/US]; 3550 General Atomics 

Court, Suite LI03, San Diego, CA 92121 (US). 

(72) Inventor: HUSE, William, D. ; 471 Avenida Primavera, 

Del Mar, CA 92014 (US). 

(74) Agents: CAMPBELL, Cathryn et al.; Pretty, Schroeder, 
Brueggemann & Clark, 444 South Flower Street, Suite 
2000, Los Angeles, CA 90071 (US). 



(81) Designated States: AT (European patent), AU, BB, BE 
(European patent), BF (OAPI patent), BG, BJ (OAPI 
patent), BR, CA, CF (OAPI patent), CG (OAPI patent), 
CH (European patent), CI (OAPI patent), CM (OAPI 
patent), CS, DE (European patent), &K (turopean pa- 
tent), ES (European patent), FI, FR (European patent), 
GA (OAPI patent), GB (European patent), GN (OAPI 
patent), GR (European patent), HU, IT (European pa- 
tent), JP, KP, KR, LK, LU (European patent), MC (Eu- 
ropean patent), MG, ML (OAPI patejit), MN, MR (OA- 
PI patent), MW, NL (European patent), NO, PL, RO, 
SD, SE (European patent), SN (OAPI patent), SU + ,TD 
(OAPI patent), TG (OAPI patent). 



Published 

With international search report. 



(54) Title: OPTIMIZATION OF BINDING PROTEINS 



(57) Abstract 



The invention relates to methods for manipulating nucleic acids so as to optimize the binding characteristics of an encoded 
binding protein by providing two or more nucleic acids encoding binding proteins having at least one set of splicing sites, the set 
of splicing sites flanking opposite ends of one or more encoded binding domains; mixing the nucleic acids to produce a parent 
population of mixed nucleic acids encoding binding proteins; and randomly incorporating the binding domains between the nuc- 
leic acids through the set of splicing sites to produce a different population of nucleic acids encoding binding proteins wherein at 
least one binding protein is characterized by substantially different binding characteristics than a member of the parent popula- 
tion. 



See back of page 



t 



+ DESIGNATIONS OF "SU" 

Any designation of «SU" has effect in the Russian Federation. It is not yet known whether any such 
designation has effect in other States of the former Soviet Union. , ^^^^^ 



FOR THE PURPOSES OF INFORMATION ONLY 

Codes used to identic Sou* party to the PCT on the front pages of pamphiets publishing international 

applications under the PCT. 



AT 


Austria 


AU 


Australia 


BB 


Barbados 


BE 


Belgium 


BF 


Burkina Faso 


BG 


Bulgaria 


BJ 


Benin 


Bft 


Brazil 


CA 


flanada 


CF 


Central African Repuhtic 


cc 


Congo 


CH 


Switzerland 


CI 


C6tc d'l voire 


CM 


Cameroon 


GS 


Ctecnoslov?kia 


DE 


Germany 


DK 


Denmark 



ES Spain 

Fl Finland 

FR France 

CA Gabon 

CB United Kingdom 

CN Guinea 

GR Greece 

HU Hungary 

IT Italy 

Jp Japan 

KP Democratic People's Republic 

of Korea 

KR Republic of Korea 

LI Liechtenstein 

LK Sri Lanka 

LfJ Luxembourg 

MC Monaco 



MG 


Madagascar 


ML 


Mali 


MN 


Mongolia 


MR 


Mauritania 


MW 


Malawi 


NL 


Nethcrlaruis 


NO 


Norway 


PL 


Poland 


RO 


Romania 


SD 


Sudan 


SB 


Sweden 


SN 


Senegal 


SU+ 


Soviet Union 


TD 


Chad 


TC 


Togo 


US 


United Slates of America 



WO 92/1 1272 



PCT/US91/09630 



OPTIMIZATION OF BI DING PROTEINS 
BACKGROUND <?P T HP. INVENTION 

This invention relates generally to ligand-protein 
binding interactions and, more particularly, to methods for 
5 the optimization of antigen-antibody binding affinity. 

Antibodies are heteromeric binding proteins generated 
by a vertebrate's immune system which bind to an antigen. 
The molecules are composed of two heavy and two light 
chains connected through disulfide bonds. Antibodies have 

10 a "Y" - shaped structure with the antigen binding portion 
being located at the end of both short arms of the Y. The 
region on the heavy and light chain polypeptides which 
corresponds to the antigen binding portion is known as the 
variable region. The binding specificity of an antibody is 

15 a composite of the antigen interactions with both heavy and 
light chain variable regions. The differences within this 
region are primarily responsible for the variation in 
binding specificities between antibodies. 

The immune system has the capability of generating an 

20 almost infinite number of different antibodies. Such a 
large diversity is generated primarily through ontological 
recombination to form the variable regions of each chain 
and through differential pairing of heavy and light chains. 
Within the framework of the variable region of each chain 

25 are domains characterized by unusually divergent 
hypervariable sequences. These hypervariable region 
sequences are in large part responsible for antibody 
diversity since they constitute the antigen binding pocket 
within the native protein. Differences in amino acid 

3 0 sequences within these regions allow for the formation of 
different antigen-antibody binding interactions. Thus, the 
hypervariable variable region sequences complement the 
antigen functional groups and, as such, are also known as 
complementarity determining regions (CDRs) . The ability to 

35 mimic the natural immune system by creating diverse 
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combinations of CDR sequences would be of great value 
because it would allow for the rapid production of 
antibodies with high affinity to essentially any desired 
antigen. Such antibodies can be used for various 
5 diagnostic and therapeutic purposes. 

Until recently, generation of antibodies against a 
desired molecule was accomplished only through manipulation 
of natural immune responses. Methods included classical 
immunization techniques of laboratory animals and 

10 monoclonal antibody production. Generation of monoclonal 
antibodies is laborious and time consuming. It involves a 
series of different techniques and is only performed on 
animal cells. Animal cells have relatively long generation 
times and require extra precautions to be taken to ensure 

15 viability of the cultures. 

A method for the generation of a large repertoire of 
diverse antibody molecules in bacteria has also been 
described, Huse et al., Science 246:1275-1281 (1989), which 
is incorporated herein by reference. The method uses the 

20 bacteriophage lambda as the vector. The lambda vector is 
a long, linear double-stranded DNA molecule. Production of 
antibodies using this vector involves the cloning of heavy 
and light chain populations of DNA sequences into separate 
vectors. The vectors are subsequently combined randomly to 

25 form a single vector which directs the coexpression of 
heavy and light chains to form antibody fragments. A 
disadvantage to this method is that the antibody affinities 
are limited to in vivo derived affinities. This difference 
in affinities is due to a specific immunological affinity 

30 maturation process known as somatic mutation. Antibodies 
derived from recombinant libraries may not have benefited 
from this process. Additionally, antibody sequences may be 
disproportionally represented within the library due to the 
cloning process itself. An inherent outcome of this event 

35 can cause selection of only moderate affinity antibodies 
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from the library if th y predominate over the high affinity 
sequences . 

There thus exists a need for a method to optimize 
antigen binding affinities which mimics the natural 
5 immunological processes and provides in vitro -derived 
antigen binding affinities different than those derived jji 
vivo- The present invention satisfies these needs and 
provides related advantages as well. ' 

SUMMARY OF THE INVENTION 

10 The invention relates to methods for manipulating 

nucleic acids so as to optimize the binding characteristics 
of an encoded binding protein by providing two or more 
nucleic acids encoding binding proteins having at least one 
set of splicing sites, the set of splicing sites flanking 
15 opposite ends of one or more encoded binding domains; 
mixing the nucleic acids to produce a parent population of 
mixed nucleic acids encoding binding proteins; and randomly 
incorporating the binding domains between the nucleic acids 
through the set of splicing sites to produce a different 
20 population of nucleic acids .encoding binding proteins 
wherein at least one binding protein is characterized by 
substantially different binding characteristics than a 
member of the parent population. 



BRIEF DESCRIPTIO N OF THE DRAWTfMS 

Figure 1 diagrams the mixing of binding domains from 
a heteromeric binding protein. Top: A heteromeric binding 
protein is encoded by two polypeptides (V H and v L ) . The 
binding pocket of each polypeptide consists of three 
distinct binding domains through and i^, through 

Lcow ( A ) Mixing of different V H CDRs with different V L 
CDRs. (B) Mixing of CDRs 1 with CDRs 2 and 3 for both 
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polypeptides. (C) Mixing of CDR 3 with CDRs 1 and 2 for 
both polypeptides. 

rfT ft T T.*n nESCRT PTTON OF THE INVENTION 

This invention is directed to simple and efficient 
5 methods to mix binding domains between binding proteins. 
The method can be used to optimize binding characteristics 
of a desired binding protein and is advantageous in thatt it 
increases the number of possible proteins capable of 
binding to a particular antigen by randomly interchanging 

10 related binding domains. Interchanging domains which 
exhibit related functions increases the possibility of 
obtaining productive differences in antigen binding 
interactions leading to substantially different binding 
interactions. The binding proteins are useful in an 

15 unlimited number of diagnostic and therapeutic procedures. 

In one embodiment, cloned populations of heavy and 
light chain antibody fragments are used to generate binding 
proteins toward a particular antigen having substantially 
different binding characteristics than the original cloned 

20 population. The antibody fragments within the original 
cloned population all exhibit binding specificity and 
affinity for the desired antigen and are thus functionally 
related. Novel antibodies are generated by randomly mixing 
different combinations of CDR sequences between the related 

25 antibodies. The CDR sequences are mixed, for example,, by 
either single-strand nucleic acid directed mutagenesis or 
by introducing a set of cleavage sites, such as restriction 
enzyme recognition sites, at locations which flank the 
desired sequences to be mixed. Each cleavage site within 

30 the set should be at opposite ends of the sequences to be 
mixed and ' the termini generated after cleaving one site 
should not be compatible for ligation with the termini 
generated at the other site. Noncompatibility of the 
termini ensures proper orientation of the mix d s quences 
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upon subsequent ligation. Th population of encoding 
nucleic acids are cleaved with a cleavage reagent, such as 
a restriction enzyme, and then re-ligated to form random 

i combinations of CDRs within the antibody fragment 

5 framework. In this way, individual CDRs within each heavy 

* or light chain can be mixed or complete sets of CDRs from 

heavy chains can be mixed with those of light chains to 
produce antibodies exhibiting substantially different 
binding characteristics toward a particular antigen than a 
10 member within the original cloned population. 

As used herein, the term "binding protein" refers to 
a protein which exhibits binding activity toward a 
particular molecule. It is understood that the term 
includes fragments of the polypeptide so long as binding 
15 function is retained. Such binding proteins can include 
single binding domain proteins as well as multi-binding 
domain proteins. Multi-binding domain proteins include 
those proteins which contain two or more discreet and non- 
contiguous regions along the primary structure of the 
20 polypeptide which together contribute to the overall biding 
activity. A specific example of multi-binding domain 
proteins includes heavy and light antibody chains since the 
binding activity of each is derived from three non- 
contiguous complementarity determining regions. Binding 
25 proteins can be monomeric or multimeric species. 
Heteromeric binding proteins are a specific example of 
multimeric binding proteins which are composed of at least 
two different subunits which together exhibit binding 
activity toward a particular ligand. It is understood that 
3 0 when referring to multimeric binding proteins that the term 
includes fragments of the subunits so long as assembly of 
the polypeptides and binding function of the assembled 
complex is retained. Heteromeric binding proteins include, 
for example, antibodies and fragments thereof such as Fab 
3 5 and F(ab , ) 2 portions, T cell r ceptors, integrins, hormone 
receptors and transmitter receptors. 
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As used herein, the term "binding domain" refers to a 
polypeptide or nucleic acid encoding a polypeptid whose 
amino acid sequence represents a specific region of a given 
protein and exhibits binding characteristics, by itself or 
5 in combination with other domains, similar to the binding 
characteristics of the native protein. The boundaries of 
the domains are not critical so long as binding activity is 
maintained. A specific example of a binding domain 
includes the complementarity determining region'within the 
•10 variable region of heavy or light chain antibody sequences. 

As used herein, the term "optimization" refers to the 
modification of the binding characteristics of a binding 
protein to achieve related binding proteins having 
substantially different binding properties as judged by any 

15 quantifiable assay system. The term includes both 
increases and decreases in affinity, avidity or 
specificity. The optimal binding characteristics will 
depend on the particular application of the binding 
protein. For example, decreasing the affinity of a high- 

20 affinity tumor antigen binding protein can be an 
optimization procedure to enable penetration of the binding 
protein into the tumor mass. . Conversely, it may be 
necessary to increase the affinity of binding proteins 
which exhibit low specificity toward a particular ligand in 

25 order to obtain optimal binding characteristics. 

As used herein, the term "splicing site" refers to a 
predetermined region within one nucleic acid which can be 
used for joining to a second nucleic acid. Methods such as 
single strand nucleic acid directed mutagenesis, 

30 hereinafter referred to as "insertional mutagenesis," and 
restriction/ligation can be used for joining the two 
nucleic acids at the splicing site. The splicing sites are 
typically homologous enough to allow hybridization between 
complementary strands. For example, insertional 

35 mutagenesis and restriction/ligation using nzymes which 
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produce single strand overhangs use splice sites which can 
hybridize between two complementary strands. 
Alternatively, splice sites contained within two nucleic 
acids can be joined without having sequence homology 
5 between complementary strands. An example of such a splice 
site is a restriction enzyme site which produces fclunt, 
double-stranded ends. 

As used herein, the term "set of splicing ^ites" 
refers to two or more splicing sites. In its simplest 

10 form, a set is a pair of splicing sites; however, a set can 
be any odd or even number of splicing sites which are used 
for joining nucleic acid segments. Three splicing sites is 
a specific example of an odd number of sites which make up 
a set. In this case, one of the splicing sites can be 

15 shared with each of the other two sites for independent or 
simultaneous use in joining nucleic acids. Replacement of 
adjacent CDR regions can be performed, for example, using 
a set of splicing sites which has two members flanking both 
CDR regions and a third member between the two CDR regions. 

20 As used herein, the term "cleavage site" refers to a 

location within a nucleic acid. in which the phosphodiester 
backbone is capable of being specifically broken. Cleavage 
sites can be, for example, restriction enzyme recognition 
sites such as Bst XI, Fok I and Bgl I. Also included 

25 within this definition are chemical cleavage sites. 

As used herein, the term "cleaving reagent" refers to 
the means by which a cleavage site is cleaved. For 
example, if the cleavage site is a restriction enzyme 
recognition site then the cleaving reagent is a restriction 
30 enzyme. Cleaving reagents which leave ends which are 
"noncompatible for ligation" as used herein, are reagents 
which produce termini containing noncomplementary singl 
strands or termini which are capable of being modified to 
contain noncomplementary single strands protruding from 
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each end of a duplex nucleic acid. Any non-isoschizomeric 
enzyme pair can be used as well as an enzyme which 
recognizes the same site within the nucleic acid but 
cleaves the phosphodiester bonds outside of the recognition 
5 sequence or at an undefined sequence within the recognition 
site. Examples of such enzymes include Bst XI , Fok I and 
Bgl I. 

The invention provides a method for the optimization 
of binding proteins. The method includes: (a) providing 

10 two or more nucleic acids encoding binding proteins having 
at least one set of splicing sites, said set of splicing 
sites flanking opposite ends of one or more encoded binding 
domains; (b) mixing said two or more nucleic acids to 
produce a parent population of mixed nucleic acids encoding 

15 binding proteins; and (c) randomly incorporating said 
binding domains between said two or more nucleic acids 
through said set of splicing sites to produce a different 
population of nucleic acids encoding binding proteins 
wherein at least one binding protein is characterized by 

20 substantially different binding characteristics than a 
member of the parent population. 

The invention will be described in reference to its 
presently preferred embodiments the optimization of 
binding affinity of antibody fragments, such as Fab and Fv, 

25 for a particular antigen. However, because the principles 
are the same for all binding proteins, the methods 
described herein can be used for the optimization of 
binding proteins other than antibodies without departing 
from the spirit of the invention. Binding proteins other 

30 than antibody fragments can include, for example, T cell 
receptors, integrins, hormone receptors and transmitter 
receptors, steroid-like hormones and DNA binding proteins. 

The antigen binding domains within antibody molecules 
are known as complementarity determining regions (CDRs) and 
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are located within the variable region of both the heavy 
and light chain polypeptides. There are three CDRs within 
each variable region (Figure 1, top) . These domains 
constitute the majority of the antigen binding interactions 
5 when properly folded into a native structure. CDRs are 
typically about 10 to 15 amino acid residues in length and 
are noncontiguous along the primary structure of the 
polypeptide backbone. 

Because CDRs constitute the majority of the binding 
10 interactions between antibodies and antigens, these 
sequences can be mutated or interchanged with other 
different CDRs to generate new antibodies with 
substantially different binding characteristics for a 
particular antigen than that of the parent molecules. 
15 Optimization of binding interactions by interchanging 
different but related domains within an antigen binding 
protein is advantageous when a series of functionally 
related binding proteins are available to a particular 
antigen. Different combinations of such domains are more 
20 likely to yield productive changes in binding 
characteristics than random sequence changes without any 
restrictions on function. Therefore, binding domains which 
recognize the same antigen in a similar fashion as well as 
domains which recognize the same antigen in different ways 
25 can be randomly mixed with their cognate domains to produce 
novel combinations yielding substantially different binding 
characteristics. Mixing is the act of changing the order 
of binding domains by mixing together. 

An example of mixing binding domains within 
3 0 functionally related proteins is to randomly produce all 
combinations of CDRs within and between heavy or light 
chains of related antibodies. Such random mixing can be 
accomplished at several 1 vels depending on whether whole 
sets of CDRs, subsets or individual CDRs are used for the 
3 5 process. First, entire sets of CDRs (CDRs 1 through 3) 
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within a heavy or light chain can be randomly mixed with 
entire sets of CDRs on the opposite chain. This mixing in 
effect produces new antibody fragments by pairing different 
heavy chains with different light chains. Second, subsets 
5 (e.g., two CDRs) or individual CDRs can be randomly mixed 
to produce similar antibodies with different binding 
characteristics due to the substitution of a fraction of 
the binding domains within a multi-binding domain protein. 
Third, antibodies selected for differential binding 
10 characteristics due to CDR mixing can be further optimized 
by selected mutagenesis. 

The mixing of CDRs can be accomplished by a variety of 
means. For example, insertional mutagenesis can be used to 
randomly mix binding domains within a parent population of 

15 two or more nucleic acids. One or more nucleic acids 
within the population can be used as templates for 
mutagenesis. The templates can be single strand nucleic 
acids produced by ordinary skills using vectors such as 
those derived from M13. Domains to be randomly mixed with 

20 the template population can be synthesized using, for 
example, polymerase chain reaction (PCR) with splicing 
sites at each terminus which are complementary to splicing 
sites flanking the binding domain (s) on the template 
nucleic acid. The source of template material for PCR can 

25 be, for example, mRNA or cloned populations of cDNAs. 
Splicing sites complementary to flanking sequences insures 
proper orientation of the inserted sequences. 
Alternatively, chemical synthesis methods Cc.n be employed 
to generate the binding domains to be mixed and their 

30 corresponding splicing sites. 

Synthesized binding domains are denatured, for 
example, with heat or alkali to produce single strands if 
they are in double strand form and annealed with templates. 
Hybridization through complementary sequences is very 
35 specific and therefore advantageous for joining two nucleic 
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acids together at predetermined sites. The criteria for 
determining where splicing sites should be located is 
described below with reference to splicing sites which are 
cleavage sites. The annealed splicing sites can be used as 
5 substrates for primer extension to produce double-stranded 
nucleic acids. If the annealed splicing sites are 
relatively large and able to maintain a stable duplex, the 
annealed population of nucleic acids can be directly 
introduced into a host organism. Biosynthetic machinery 

10 within the organism is capable of producing double-stranded 
nucleic acids. Populations of nucleic acids encoding 
binding proteins produced in such a way will have their 
binding domains randomly incorporated within the parent 
population of binding proteins to produce a different 

15 population of nucleic acids where at least one encoded 
binding protein is characterized by substantially different 
binding characteristics than a member of the parent 
population. 

Mixing of CDRs can also be accomplished by introducing 

20 sets of splicing sites, such as cleavage sites, at 
predetermined locations within the nucleic acids encoding 
the binding proteins. This .is shown schematically in 
Figure 1 where the sets of cleavage sites are represented 
as A and A 1 , B and B' or C and C . The cleavage sites can 

25 be, for example, restriction enzyme recognition sequences. 
Such recognition sequences can be introduced into the 
nucleic acids by methods well known in the art. Site- 
directed mutagenesis as described by Kunkle et al., Proc. 
Natl. Acad. Sci. U.S.A., 82:488-492 (1985), which is 

3 0 incorporated herein by reference, is one such method. The 
cleavage sites should be selected so that the termini 
produced at the two ends after cleaving are not compatible 
for subsequent ligation. Noncompatible termini ensures 
that the fragments will come together in the proper 

3 5 orientation aft r mixing of CDR sequences. For example, if 
a vector encoding heavy (V H ) and light (V L ) chain fragments 
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(Figure 1, top) is cleaved with cleaving reag nt A, the 
termini produced will be A and A' . A termini are 
compatible with other A termini but not with A * termini. 
Upon subsequent ligation, only As will be joined with other 
5 As and similarly, A's with other A's, thus reconstructing 
the original orientation of the V H and V L fragments. Sets 
of cleavage sites should also be selected so that they are 
uniquely found at their preselected location. Redundant 
cleavage sites encoded at other locations within the vector 
10 or coding region of the binding proteins will also be 
cleaved during mixing. Cleavage at these undesirable 
redundant sites will generate extra fragments which will 
essentially be eliminated during the mixing process. 

Splicing sites selected from restriction enzyme 

15 recognition cleavage sites and sequences having 
complementary homology are advantageous to use for the 
optimization of binding affinities, however, alternative 
sites can also be used. For example, chemical cleavage 
methods known to one skilled in the art can be used which 

20 specifically cleave at predetermined locations. 
Restriction enzyme recognition sites, however, are well 
characterized and their cleaving reagents, restriction 
enzymes, are commercially available' at inexpensive prices. 
Several restriction enzymes exist which recognize a 

25 specific sequence but cleave the nucleic acid outside of 
the recognition sequence or at an undefined sequence within 
the recognition site. This mechanism offers an additional 
advantage in that the same enzyme can be used to produce 
different termini at two different cleavage sites. For 

30 example, Fok I is a restriction enzyme which recognizes the 
sequence 5«-GGATG-3' but cleaves the phosphodi ester 
backbone of double-stranded DNA nine and thirteen bases 
downstream for the top and bottom strands, respectively. 
Therefore, the sequences at the distal cleavage sites will 

35 determine ligation compatibility and not the invariant 
recognition sequence. 
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The cl avage sites introduced into nucleic acids 
encoding the binding proteins should be positioned such 
that "they flank opposite ends of the binding domains to be 
mixed. The sites should also be positioned at locations 
5 which are non-essential for function of either the binding 
protein, expression sequences or vector sequences. If a 
cleavage site is to be placed within an essential coding 
region, then the site should be chosen so as to allow only 
conservative substitutions of amino acid Residues. 

10 Splicing sites using complementary homology should be 
chosen such that the sequence is sufficiently homologous 
within all members of the population to allow 
hybridization. For the mixing of entire sets of V H CDRs 
with sets of V L CDRs, the cleavage sites should be located 

15 outside of all three CDRs for each chain. In Figure 1, 
top, the pair of cleavage sites A and A 1 depicts this 
arrangement for adjacent V H and V L sequences. The A 
cleavage site is located between the V H CDR3 (Hq^) and the 
V L CDR1 (Lcd R1 ) • The A 1 cleavage site is located on the 

20 other side of the vector between H CDR1 and 1^3. The exact 
positioning of cleavage site sets can vary depending on the 
organization of the domains to be mixed. One skilled in 
the art will know where to introduce the sets of cleavage 
sites or determine the appropriate complementary sequence 

25 so as to flank the binding domains and allow mixing. 

Two or more nucleic acids encoding V H and V L antibody 
chains can be mixed to produce a random population encoding 
all the domains to be mixed. Mixing of V H CDRs with V L CDRs 
using cleavage sites as splicing sites is performed by 

3 0 cleaving the mixed population of nucleic acids with 
cleaving reagent A (Figure 1A) and then ligating the 
mixture to reform random combinations of V H CDRs with V L 
CDRs. Antibodies which exhibit differential binding 
affinities compared to the parent molecules can be selected 

35 by expressing the new population of ncoding nucleic acids 
in a procaryotic or eucaryotic expr ssion system using 
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methods well known in the art. 

Nucleic acids encoding the binding proteins can be 
cloned into any of a variety of vectors well known in the 
art. Alternatively, the nucleic acids can be initially 
5 cloned into a vector which enables both the optimization 
procedures described herein and the expression of 
polypeptides which constitutes the binding protein (Figure 
l, top) . For heteromeric binding proteins, coexpression- of 
two or more nucleic acid sequences is generally performed 

10 to produce separate polypeptides. The separate 

polypeptides self-assemble post- or co-translationally to 
form the functional molecule. Coexpression can also be 
accomplished, for example, by expressing the subunit 
polypeptides as a single chain which are linked together 

15 and are still capable of assembling into an heteromeric 
binding protein. A specific example of the coexpression of 
linked polypeptides is the expression of heavy and light 
chain antibody fragment polypeptides with a flexible 
peptide or polypeptide linker joining the two subunits into 

20 a single chain. The linker is flexible enough to allow 
association of heavy and light chain portions into a 
functional Fab fragment, U.S. -Pa.tent No. 4,946,778, which 
is incorporated herein by reference 1 . 

Vectors which can be used for coexpression are known 
25 or can be constructed by those skilled in the art. 
Likewise, vectors which can be used for the expression of 
monomeric binding proteins are known or can be constructed 
by those skilled in the art. Such vectors should contain 
all expression elements necessary for the transcription, 
30 translation, regulation and sorting of the polypeptides 
which make up the binding proteins. Such vectors may also 
include mechanisms for recovery in a different form. 
Phagemids are a specific example of this because they can 
be used either as plasmids or as phage vectors. The 
35 vectors can be for use ither in procaryotic or eucaryotic 
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host systems so long as th expression lements are of a 
compatible origin. One of ordinary skill in the art will 
know which host systems are compatible with a particular 
vector. Thus r the invention provides vectors, host cells 
5 transformed with the vectors and binding proteins produced 
from the host cells which contain nucleic acids encoding z 
binding protein produced by the above method for the 
optimization. 

For the mixing of individual CDRs or subsets of CDRs, 
10 a strategy identical to that described above for mixing 
whole sets of CDRs can be employed. For example, CDR 1 can 
be randomly mixed with CDRs 2 and 3 in both V H and V L 
encoding portions by using splicing sites between CDRs 1 
and 2 of both V H and V L . The two splicing sites introduced 
15 constitute the set of sites which can be used for the 
mixing process and are depicted in Figure IB as sites B and 
B 1 . The set of splicing sites therefore flanks H CDR1 and 
since the binding domains in between are those which are to 
be mixed. Nucleic acids having splicing sites between CDRs 

2 0 1 and 2, for example, can be mixed to produce a random 

population. Sequences between the splicing sites are 
randomly incorporated among fchs population by, for example, 
insert ional mutagenesis or restriction/ligation. For 
insertional mutagenesis, the population of nucleic acids 
25 should first be annealed with chosen template molecules. 
For restriction/ligation, cleaving the nucleic acids first 
at position B and B' will release the binding domains to be 
mixed (H^, H^, L^; Figure IB). Subsequent ligation of 
the mixed and cleaved population brings together random 

3 0 combinations of CDR1 with CDRs 2 and 3 for both the V H and 

V L chains. The novel combinations of binding domains can be 
screened by expression for productive changes in binding 
affinity as described above. Such methods of screening 
include, for example, ELISA, immunoblots, panning, plaque 
3: lifts, equilibrium dialysis and comp tition assays. Thes 
methods are well known to one skilled in the art and are 
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described, for example, in Harlow et al., Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory, Cold 
Spring Harbor (1988) , which is incorporated herein by 
reference. 

5 Binding domains other than CDR1 can also be mixed. 

For example, CDR3 can be mixed with CDRs 1 and 2 using the 
methods described herein. By using splicing sites between 
CDRs 2 and 3 in both the V H and V L sequences, 'the binding 
domains internal to these sites can be mixed (Figure 1C) . 
10 Populations of nucleic acids encoding binding proteins with 
related functions can be mixed, treated with cleaving 
reagent C and subsequently re-ligated to produce random 
combinations of binding domains. Alternatively, templates 
and synthesized binding domains having splicing sites 

15 located at C can be mixed with random incorporation of the 
binding domains by insertional mutagenesis. Similarly, 
CDR2 can be randomly mixed with CDRs 1 and 3. For example, 
a set of splicing sites containing three members can be 
used and is advantageously performed by insertional 

20 mutagenesis. Two of the splicing sites can be located 
external to CDR2 on both the heavy and light chain 
sequences. The third splicing site can be located between 
the CDRs, for example, within the Sequences spanning 
and I^oRi- Likewise, adjacent CDRs can be mixed by having 

25 splicing sites located externally and a common site located 
between them. Annealing of nucleic acids having splicing 
sites with the above described organization will result in 
the random incorporation of, for example, H^j and L tDR2 
within the population. Additionally, there are some 

30 bacterial strains which exist, such as DH10B (Life 
Technologies, Bethesda, MD) which demonstrate the 
occurrence of DNA repair functions before DNA replication. 
In these strains, approximately equal frequencies of 
sequences corresponding to the synthesized nucleic acids 

35 and template nucleic acids will be obtained. Thus numerous 
different templates can be used with numerous different 
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synthesized nucleic acids to obtain all possible 
permutations independently of ach other. 

The binding proteins selected which exhibit optimized 
binding characteristics compared to the unmixed, parent 
5 molecules can be further optimized by selected mutagenesis. 
Although the combinations obtained after mixing of related 
binding domains can provide binding proteins having 
beneficial characteristics due to differential ahtigen 
binding characteristics, those molecules selected may not 

10 necessarily encode the best sequences required for optimal 
antigen binding properties. The selected sequences are 
simply the optimal sequences which can be obtained from the 
starting parent molecules. Selected mutagenesis 

incorporating random amino acid changes, or a desired bias 

15 of amino acid changes based on the known sequence, can 
further provide productive changes which yield more 
beneficial binding characteristics. 



Mutagenesis can be performed by standard methods in 
the art using oligonucleotides designed to incorporate 

20 specific changes at predetermined positions within the 
encoding nucleic acids. . Alternatively, the 

oligonucleotides used for mutagenesis can be synthesized by 
a method which produces diverse amino acid codons but is 
biased toward a predetermined sequence; the predetermined 

25 sequence being one or all of the binding domains within an 
optimized binding protein. This method of synthesis is 
advantageous to use for creating mutant oligonucleotides 
when a significant number of amino acid codon positions are 
to be of a specified sequence. The method is disclosed in 

30 detail in U.S. Patent application Serial No. 07/573,648 
which is explicitly incorporated herein by reference. 
Briefly, the method entails oligonucleotide synthesis using 
two reaction columns. In the first column, the nucleotide 
sequence of the specified codon is sequentially 

35 synthesized. In the second column, a degenerate codon is 
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synthesized by coupling a mixture of all four nucleotides 
at the first and second monomer positions of the codon. 
The codon is completed by coupling a mixture of either 
guanine and thymine or cytosine and adenine nucleotides at 
5 the third monomer position. Mixing the reaction products 
of the two columns yields a population of oligonucleotides 
containing both the predetermined sequence and the random 
sequence at the desired position. Synthesis can proceed by 
using .this mixture for coupling additional predetermined 

10 sequences, or, by further dividing into two reaction 
columns for synthesis of additional random sequences. By 
using such mutant oligonucleotides, changes can be made in 
the binding domains and the resultant binding proteins can 
be selected for optimized binding characteristics using 

15 standard recombinant expression methods as described above. 

The following examples are intended to illustrate 
but not limit the invention. 



EXAMPLE I 
Systematic Mixing of CD Rs for the 
20 optimization of An tigen Binding 

This example shows the optimization of Fab binding 
affinities through successive rounds of mixing 
V H and V L CDRs and selection of molecules demonstrating 
higher affinities for human B-endorphin than those Fab 
25 fragments obtained in previous rounds. 

fneorporation of Restriction Sites for CDR Mixing 

Fab fragments selected from recombinant cell surface 
expression libraries and having affinity for the human B- 
endorphin peptide were used for optimization protocols. In 
30 this system, which is described in detail in U.S. Patent 
application Serial No. 07/590,219 and incorporated herein 
by reference, heavy and light chain sequences are 
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coexpressed on the surface of filamentous bacteriophage M13 
from a dicistronic op ron under the transcriptional control 
of the Lac promoter/ operator . Unless otherwise stated, all 
recombinant methods performed for routine manipulations, 
5 such as phage culturing, vector isolation, DNA restriction, 
ligation and transformation and the like, were standard 
methods known to one skills in the art. Such methods can 
be found in SanBrook et al.. Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory, 'Cold 
10 Spring Harbor, 1989, and in Ausubel et al., Current 
Protocols in Molecular Biology, John Wiley and Sons, New 
York, 1989, both of which are incorporated herein by 
reference. 



To obtain the restriction sites used for mixing CDRs, 

15 mutagenesis was performed where necessary to give the 
restriction patterns diagrammed in Figure 1. "A" and "A ,M 
restriction sites correspond to Bst XI, "B" and M B ,H 
correspond to Dra III and M C" and "C" correspond to Fok I. 
Only three sites, corresponding to A, A 1 and B 1 in Figure 

20 1, were necessary to incorporate by mutagenesis since the 
remaining sites are found naturally in most heavy and light 
chain sequences. However, Dra III and a Fok I sites were 
removed from the vector using the' mutant oligonucleotides 
5 ' -CGATGGCCCTCTACGAGAACCATC-3 ' AND 5 ' - 

25 ATAATATCCCAACCTAATTTAC-3 ' . Mutagenesis is performed using 
the method of Kunkel et al., Meth. Enzymol. 154:367-382 
(1987), which is ' incorporated herein by reference. The 
reagents strains and protocols were obtained from a Bio Rad 
Mutagenesis Kit (Bio Rad, Richmond, CA) and mutagenesis is 

3 0 performed as recommended by the manufacturer. Briefly, an 
overnight culture of CJ236 is inoculated (1 ml) into 50 uls 
of SOB-CAP and grown to OD^q = 0.3. Phage containing the 
template to be mutagenized are diluted 1:100 and 10 ill is 
added to the culture. Incubation is continued at 37 °C for 

35 an additional 4 hours. Cultures are centrifuged twice at 
12,000 rpm for 15 minutes at 4'C and the supernatant is 
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transferred to a new tube. RNase A is added (15 fil of 10 
mg/ml) followed by 7.5 mis of 20% PEG 8000 in 3.5 M NH 4 OAc. 
The suspension is incubated on ice for 30 minutes and 
centrifuged once as described above. The pellet is 
5 resuspended in. 200 fil high salt buffer (300 mM NaCl, 100 mM 
Tris, pH 8.0, 1 mM EDTA) and filtered on both XLl-Blue 
(Stratagene, La Jolla, CA) and on GT236. CJ236 phage 
stocks are extracted with an equal volume of 
phenol/chloroform/isoamyl alcohol (1:1:1/48) followed' by 
• io back extraction with dH 2 0. These uracil-containing nucleic 
acids are ethanol precipitated, washed, resuspended in 20 
Hi dH 2 0 and quantitated. The uracil-containing template 
(233 fig) is mixed with 20 ng (in 1 fil) of phosphorylated 
mutant oligonucleotide and 1 Ml of 10 X annealing buffer 
15 (200 mM Tris, pH 7.4, 20 mM MgCl 2 , 500 mM NaCl) in 10 til 
final volume, heated to 70 *C and slow cooled to room 
temperature. The reactions are placed on ice and 1 fil of 
10 X synthesis buffer (4 mM each dNTP, 7.5 mM ATP, 175 mM 
Tris, pH 7.4, 37.5 mM MgCl 2 , 215 mM DTT) , 1 Hi T4 DNA ligase 
20 (3-5 units) and 1 fil T4 Dna polymerase is added. The 
reactions are incubated for 5 minutes on ice. followed by 90 
minutes at 37 *C. Reactions are stopped by addition of 90 
jil of stop buffer (10 mM Tris, . pH 8.0, 10 mM EDTA) and 
transformed into XLl-Blue cells. ' 

25 The sequences of the mutant oligonucleotides are given 

in Table I where the underlined portion indicates the 
incorporated restriction enzyme site. The oligonucleotides 
correspond to the non-coding strand of the V h , V t or vector 
sequence. All mutations are confirmed by restriction 

30 enzyme digestion and sequence analysis. Sequencing is 
performed by the method of Sanger et al., Proc. Natl. Acad. 
Sci. U.S.A. 74:5463-5467 (1977), using Sequenase* version 
2.0 DNA sequencing kit as recommended by the manufacturer. 
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TABLE I 

^^r^ T^i^ sequent used for Kntagepesjs 

Incorporated fn , M 
m^nn^eotide strict jpn Site p pqn- nn- f^o 3 ) 

Bst XI 5'- CGACGTTGTAAAACC 

A AGGCCAGTGGCAAGTGA 

CG-3 I , 

, Bst XI 5 '- CCACAAGAATGGAG 

A TTAAGCCC-3 ' 



20 



B' 



15 C« 



Dra III 5 1 - TGGTACCACCAGG 

TGGGACAGCCTAGGCTG 
TCCCACCTGGTGGTAC 
CA-3 ' 

Fok j 5 ' - ATAAACTCCAACAT 

CCTCAGC-3 1 



^ ^j^t-ion of piling Affinity by CDR Mixing 

For optimization of Fab binding affinities, three 
successive rounds of CDR mixing is performed. Each round 
of mixing expands the number of .combinations which can be 
tested by using the newly generated antibodies in different 
CDR combinations with themselves and with the original set 
of selected Fab fragments. 

For the first set of combinations, heavy chain CDRs 
25 are mixed with different light chain CDRs between 
antibodies exhibiting antigen binding affinity for human B- 
endorphin. This mixing efficiently allows testing of 
different heavy chain antigen binding sequences with 
different complementary light chain antigen binding 
30 sequences. 

For the mixing of heavy and light chain CDRs, the 



WO 92/11272 



PCT/US91/09630 



mutagenized vectors described above encoding the antibody 
sequences were digested with Bst XI. Equal amounts of each 
vector is combined to yield a final amount between 5 and 10 
fig of total DNA. The digestions are checked for completion 
5 by aganose gel electrophoresis. Reactions are stopped by 
extracting once with an equal volume of phenol/chloroform, 
once with chloroform, followed by ethanol precipitation. 
The pellets are washed with 70% ethanol, dried under vacuum 
and resuspended in 8.5 /il distilled or deionized water 
, 10 (dH 2 0) . One /il of 10 x ligase buffer (50 mM Tris-HCl, pH 
7.8, 10 mM MgCl 2 , 20 mM DTT, 1 mM ATP, 50 fig/ml BSA) is 
added, followed by 0.5 jil of T4 DNA ligase (2 U//il; 
Bethesda Research Laboratories, Gaithersburg, MD) . The 
ligations are incubated at 16 *C overnight or for 2 hours at 
15 room temperature prior to electroporation into a compatible 
host strain. 

E» coli MK30-3 is electroporated as described by Smith 
et al., Focus 12:38-40 (1990) which is incorporated herein 
by reference. The cells are prepared by inoculating a 
20 fresh colony of MK30-3 into 5 mis of SOB without magnesium 
(20 g bacto-tryptone, 5 g bacto-yeast extract, 0.584 g 
NaCl, 0.186 g KC1, dH 2 0 to 1,000 mis) and grown with 
vigorous aeration overnight at 37 *C. 1 SOB without magnesium 
(500 ml) is inoculated at 1:1000 with the overnight culture 
25 and grown with vigorous aeration at 37 *C until the OD 550 is 
0.8 (about 2 to 3 h) . The cells are harvested by 
centrifugation at 5,000 rpm (2,600 x g) in a GS3 rotor 
(Sorvall, Newtown, CT) at 4'C for 10 minutes, resuspended 
in 500 ml of ice-cold 10% (v/v) sterile glycerol, 
30 centrifuged and resuspended a second time in the same 
manner. After a third centrifugation, the cells are 
resuspended in 10% sterile glycerol at a final volume of 
about 2 ml, such that the OD 550 of the suspension was 200 to 
300. Usually, resuspension is achieved in th 10% glycerol 
35 that remained in the bottle after pouring off the 
supernate. Cells are frozen in 40 nl aliquots in 
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microcentrifuge tubes using a dry ice-ethanol bath and 
stored frozen at -70' C. 

Frozen cells are electroporated by thawing slowly on 
ice before use and mixing with about 10 pg to 500 ng of 
5 vector per 40 /xl of cell suspension. A 40 pi aliquot is 
placed in an 0.1 cm electroporation chamber (Bio-Rad, 
Richmond, CA) and pulsed once at 0*C using 4 kfl parallel 
resistor 25 pF, 1.88 KV, which gives a pulse length (r) of 
"4 ms. A 10 Ml aliquot of the pulsed cells are diluted 

10 into 1 ml SOC (98 mis SOB plus 1 ml of 2 M MgCl 2 and 1 ml of 
2 M glucose) in a 12- x 75-mm culture tube, and the culture 
is shaken at 37 *C for 1 hour prior to culturing in 
selective media. Briefly, the above 1 ml library cultures 
are grown up by diluting 50-fold into 2XYT media (16 g 

15 tryptone, 10 g yeast extract, 5 g NaCl) and culturing at 
37 # C for 5-8 hours. The bacteria are pelleted by 
centrifugation at 10,000 x g. The supernatant containing 
phage is transferred to a sterile tube and stored at 4'C. 

Purified phage expressing on their surface coat the 
20 newly created Fab fragments are prepared from 50 ml liquid 
cultures of XL1 Blue cells (Stratagene, La Jolla, CA) 
which had been infected at a m.oii. of 10 from the phage 
stocks stored at 4*C. The cultures are induced with 2 mM 
IPTG. Supernatants are cleared by two centrifugations, and 

2 5 the phage are precipitated by adding 1/7.5 volumes of PEG 

solution (25% PEG-8000, 2.5 B NaCl) , followed by incubation 
at 4 # C overnight. The precipitate is recovered by 
centrifugation for 90 minutes at 10,000 x g. Phage pellets 
are resuspended in 25 ml of 0.01 M Tris-HCl, pH 7.6, 1.0 mM 

3 0 EDTA, and 0.1% Sarkosyl and then shaken slowly at room 

temperature for 30 minutes. The solutions are adjusted to 
0.5 M NaCl and to a final concentration of 5% polyethylene 
glycol. After 2 hours at 4-C, the precipitates containing 
the phage are recovered by centrifugation for 1 hour at 
35 15,000 X g. The precipitates are resuspended in 10 ml of 
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NET buffer (0.1 M NaCi, 1.0 mM EDTA, and 0.01 M Tris-HCl, 
pH 7.6), mixed well, and the phage repel leted by 
centrifugation at 170,000 X g for 3 hours. The phage 
pellets are resuspended overnight in 2 ml of NET buffer and 
5 subjected to cesium chloride centrifugation for 18 hours at 
110,000 X g (3.86 g of cesium chloride in 10 ml of buffer) . 
Phage bands are collected, diluted 7-hold with NET buffer, 
recentrifuged at 170,000 X g for 3 hours, resuspended, and 
stored at 4'C in 0.3 ml of NET buffer containing 0.1 mM 
10 sodium azide. 

Binding affinity is assessed by panning surface 
expression phage on dishes coated with human 6-endorphin. 
6-endorphin is attached to streptavidin coated dishes 
following biotinylation and absorption against phage grown 

15 in the presence of uracil in a Sup E una" du£* strain such 
as RZ1032. Phage grown in this way insures that any phage 
remaining after blocking will not survive under normal 
growth conditions and therefore will not interfere with 
panning. Alternatively, standard reverse immunoscreening 

20 methods using labeled antigen can also be used. The 
biotinylating reagents are dissolved in dimethyl formamide 
at a ratio of 2.4 mg solid NHS-SS-Biotin (sulfosuccinimidyl 
2-(biotinamido)ethyl-l,3 '-dithiopropionate; Pierce, 
Rockford, IL) to 1 ml solvent and used as recommended by 

25 the manufacturer. Small-scale reactions are accomplished 
by mixing 1 pi dissolved reagent with 43 nl of 1 mg/ml 8- 
endorphin diluted in sterile bicarbonate buffer (0.1 M 
NaHC0 3 , pH 8.6). After 2 hours at 25 *C, residual 
biotinylating reagent is reacted with 500 nl 1 K 

30 ethanolamine (pH adjusted to 9 with HC1) for an additional 
2 hours. The entire sample is diluted with 1 ml TBS 
containing 1 mg/ml BSA, concentrated to about 50 »1 on a 
Centricon 30 ultra-filter (Amicon) , and washed on the same 
filter three times with 2 ml TBS and once with 1 ml TBS 

35 containing 0.02% NaN 3 and 7 x 10 12 DV- inactivated blocking 
phage (see below); the final retentate (60-80 nl) is stored 
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at 4 # C. B-endorphin biotinylated with the NHS-SS-Biotin 
reagent is linked to biotin via a disulf ide-containing 
chain. 

For panning, polystyrene petri plates (60 x 15 mm) are 
5 incubated with 1 ml of 1 mg/ml of streptavidin (BRL) in 0.1 
M NaHCOj pH 8.6-0.02% NaN 3 in a small, air-tight plastic box 
overnight in a cold room. The next day streptavidin is 
removed and replaced with at least 10 ml blocking soltition 
(29 mg/ml of BSA; 3 /xg/ml of streptavidin; 0.1 M NaHC0 3 pH 
10 8.6-0.02% NaN 3 ) and incubated at least 1 hour at room 
temperature. The blocking solution is removed and plates 
are washed rapidly three times with Tris buffered saline 
containing 0.5% Tween 20 (TBS-0.5% Tween 20). 

Selection of phage expressing antibody fragments which 
15 bind B-endorphin is performed with 5 jil (2.7 B- 
endorphin) of blocked biotinylated B-endorphin reacted with 
a 50 Ml portion of the library. Each mixture is incubated 
overnight at 4'C, diluted with 1 ml TBS-0.5% Tween 20, and 
transferred to a streptavidin-coated petri plate prepared 
20 as described above. After rocking 10 minutes at room 
temperature, unbound phage are removed and plates washed 
ten times with TBS-0.5% Tween 20' over a period of 30-90 
minutes. Bound phage are eluted from plates with 800 Ml 
sterile elution buffer (1 mg/ml BSA, 0.1 M HC1, pH adjusted 
25 to 2.2 with glycerol) for 15 minutes and eluates 
neutralized with 48 Ml 2 M Tris (pH unadjusted) . A 20 ixl 
portion of each eluate is titered on MK30-3 concentrated 
cells with dilutions of input phage. 

A second round of panning is performed by treating 750 
3 0 Ml of first eluate from the library with 5 mM DTT for 10 
minutes to break disulfide bonds linking biotin groups to 
residual biotinylated binding proteins. The treated eluate 
is concentrated on a Centricon 30 ultrafilter (Amicon) , 
washed three times with TBS-0.5% Tween 20, and concentrated 
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to a final volume of about 50 fil. Final retentate is 
transferred to a tube containing 5-0 Ml (2.7 M9 B- 
endorphin) blocked biotinylated B-endorphin and incubated 
overnight. The solution is diluted with 1 ml TBS-0.5% 
5 Tween 20, panned, and eluted as described above on fresh 
streptavidin-coated petri plates. The entire second eluate 
(800 Ml) is neutralized with 48 nl 2 M Tris, and 20 u\ is 
titered simultaneously with the first eluate and dilutions 
of the input phage. If necessary, further rounds ' of 
* 10 panning can be performed to obtain homogeneous populations 
of phage. Additionally, phage can be plaque purified if 
reagents are available for detection. 

The phage selected by panning are characterized by DNA 
sequencing of the Fab encoding sequences. Templates are 

15 prepared for sequencing by inoculating a 1 ml culture of 
2XYT containing a 1:100 dilution of an overnight culture of 
XL1 with an individual plaque from the purified population. 
The plaques are picked using a sterile toothpick. The 
culture is incubated at 37 # C for 5-6 hours with shaking and 

20 then transferred to a 1.5 ml microfuge tube. 200 m! of PEG 
solution is added, followed by vortexing and placed on ice 
for 10 minutes. The phage pre.cipitate is recovered by 
centrifugation in a microfuge at 12', 000 x g for 5 minutes. 
The supernatant is discarded and the pellet is resuspended 

25 in 230 Ml of TE (10 mM Tris-HCl, pH 7.5, 1 mM EDTA) by 
gently pipeting with a yellow pipet tip. Phenol (200 m1) 
is added, followed by a brief vortex and microfuged to 
separate the phases. The aqueous phase is transferred to 
a separate tube and extracted with 200 m! of 

30 phenol/chloroform (1:1) as described above for the phenol 
extraction. A 0.1 volume of 3 M NaOAc is added, followed 
by addition of 2.5 volumes of ethanol and precipated at - 
20 *C for 20 minutes. The precipated templates are 
recovered by centrifugation in a microfuge at 12,000 x g 

3 5 for 8 minutes. Tha pellet is washed in 70% ethanol, dried 
and resuspended in 25 m! TE. Sequencing was performed 
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using a Sequenase™ sequencing kit following the protocol 
suppli d by the manufacturer (U.S. Biochemical, Cleveland, 
OH) . 

The second round of CDR mixing entails randomly 
5 combining different CDR combinations within heavy and light 
chain populations of tl:a selected antigen binding 
fragments. CDR 2 and 3 sequence were mixed within heavy 
and light chain populations with different CDR' 1 sequences. 
The antibodies obtained from both the heavy and light chain 
10 combinations and from the initially selected antibodies 
were used for these optimization procedures. 

Although an initial round of optimization has already 
been performed, the vectors containing selected Fab 
sequences still encode the restriction enzyme sites 
15 incorporated by mutagenesis. Therefore, equal amounts of 
each vector are combined (5-10 /xg total) and digested to 
completion with Fok I. The subsequent steps of ligation, 
electroporation and screening are performed as described 
above. 

20 The third round of CDR. mixing entails randomly 

combining CDRs 1 and 2 within 'heavy and light chain 
populations with different CDR 3 sequences. All previously 
obtained antibodies selected for binding affinity to human 
B-endorphin were mixed at equal amounts and digested with 

25 Bgl I as described previously. Similarly, the ligations 
and screening were also performed as described above. 
Quantitation of the binding affinities is performed by 
equilibrium dialysis against labeled B-endorphin and 
Schatand plot analysis. 

30 Although the invention has been described with 

reference to the presently-preferred mbodiment, it should 
be understood that various modifications can be made by 
those skilled in the art without departing from the 
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invention. Accordingly, the invention is limited only by 
the following claims. 
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I Claim: 

1. A method for manipulating nucleic acids* so as to 
optimize the binding characteristics of an encoded binding 
protein , comprising : 

(a) providing two or more nucleic acids encoding 
5 binding proteins having at least one set of 

splicing sites, said set of splicing kites 
flanking opposite ends of one or more 
encoded binding domains; 

(b) mixing said two or more nucleic acids to 
10 produce a parent population of mixed nucleic 

acids encoding binding proteins; and 

(c) randomly incorporating said binding domains 
between said two or more nucleic acids 
through said set of splicing sites to 

15 produce a different population of nucleic 

acids encoding binding proteins wherein at 
least one binding protein is characterized 
by substantially different binding 
characteristics than a member of the parent 

20 population. 

2. The method of claim 1, wherein step (a) further 
comprises synthesis of at least one nucleic acid encoding 
a binding protein by enzymatic polymerization. 

3. The method of claim 2, wherein said enzymatic 
polymerization is a polymerase chain reaction. 



4. The method of claim 1, wherein step (a) further 
comprises chemical synthesis of at least one nucleic acid 
encoding a binding protein. 
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5. The method of claim 1, wherein step (c) further 
comprises : 

(cl) introducing at least one set of cleavage 
sites within said parent population of mixed 
nucleic acids at said splicing sites; 
(c2) cleaving said cleavage sites with a cleaving 
reagent to produce termini at each splicing 
site which are non-compatible for ligation 
and releasing said one or more encoded 
binding domains from said nucleic acids 
encoding binding proteins; and 
(c3) ligating said released binding domains to 
said nucleic acids encoding binding 
proteins . 

6. The method of claim 2, wherein step (c) further 
comprises annealing single strand regions of said splicing 
sites between said two or more nucleic acids, said splicing 
sites having sufficient nucleotide sequence identity 

5 between complementary strands to allow specific 
hybridization. 

t 

7. The method of claim 6, wherein said two or more 
nucleic acids are single-stranded. 

8. The method of claim 4, wherein step (c) further 
comprises annealing single-stranded regions of said 
splicing sites between said two or more nucleic acids, said 
splicing sites having sufficient nucleotide sequence 

5 identity between complementary strands to allow specific 
hybridization. 

9. The method of claim 8, wherein said two or more 
nucleic acids are i : ingle-stranded. 
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10. The method of claim 1, wherein said binding 
proteins are antibodies. 

11. The method of claim 1, wherein said binding 
domains are CDRs. 

12. The method of claim 5, wherein said set of 
cleavage sites are restriction enzyme recognition sites. 

13. The method of claim 12, wherein said restriction 
enzyme recognition sites are selected from the group 
consisting of Bst XI, Fok I and Dra III. 

14. The method of claim 5, wherein said cleaving 
reagent is a restriction enzyme. 

15. A nucleic acid encoding a binding protein 
produced by a process, comprising: 

(a) providing two or more nucleic acids encoding 
binding proteins having at least one set of 

5 splicing sites, said set of splicing sites 

flanking opposite ends of one or more 
encoded binding domains; 

(b) mixing said two or more nucleic acids to 
produce a parent population of mixed nucleic 

10 acids encoding binding proteins; and 

(c) randomly incorporating said binding domains 
between said two or more nucleic acids 
through said set of splicing sites to 
produce a different population of nucleic 

15 acids encoding binding proteins wherein at 

least one binding protein is characterized 
by substantially different binding 
characteristics than a member of the parent 
population. 
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16. The nucleic acid of claim 15, wherein said 
binding proteins are antibodies. 

17. The nucleic acid of claim 15 , wherein said 
binding domains are CDRs. 

18. A vector containing a nucleic acid encoding a 
binding protein produced by a process, comprising: 

(a) 



(b) 
(c) 



19. The vector of claim 18, wherein said binding 
proteins are antibodies. 

20. The vector of claim 18 , wherein said binding 
domains are CDRs. 

21. A host cell transformed with the vector of claim 

18. 

22. The host cell of claim 21, wherein said binding 



providing two or more nucleic acids encoding 
binding proteins having at least one set of 
splicing sites, said set of splicing sites 
flanking opposite ends of one or more 
encoded binding domains; 

mixing said two or more nucleic acids to 
produce a parent population of mixed nucleic 
acids encoding binding proteins; and 
randomly incorporating said binding domains 
between said two or more nucleic acids 
through said set of splicing sites to 
produce a different population of nucleic 
acids encoding binding proteins wherein at 
least one binding . protein is characterized 
by substantially ' different binding 
characteristics than a member of the parent 
population. 
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proteins are antibodi s. 

23. The host cell of claim 21, wherein said binding 
domains are CDRs. 

24. A binding protein produced by the host cell of 
claim 21. 

25. The binding protein of claim 24, wherein 'said 
binding protein is an antibody. 



26. The binding protein of claim 24, wherein said 
binding domains are CDRs. 
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